AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
High-precision image-text matching

# High-precision image-text matching

Vit SO400M 14 SigLIP2 378
Apache-2.0
SigLIP 2 vision-language model trained on WebLI dataset, supporting zero-shot image classification tasks
Text-to-Image
V
timm
1,596
1
Spec Vision V1
MIT
Spec-Vision-V1 is a lightweight, state-of-the-art open-source multimodal model designed for deep integration of visual and textual data, supporting a 128K context length.
Text-to-Image Transformers Other
S
SVECTOR-CORPORATION
17
1
Vit L 14 CLIPA 336 Datacomp1b
Apache-2.0
CLIPA-v2 model, an efficient contrastive image-text model, focused on zero-shot image classification tasks.
Text-to-Image
V
UCSC-VLAA
239
2
Vit B 16 SigLIP
Apache-2.0
SigLIP (Sigmoid Loss for Language Image Pre-training) model trained on the WebLI dataset for zero-shot image classification tasks.
Text-to-Image
V
timm
27.77k
31
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase